Automatic prosodic labeling of accent information for Japanese spoken sentences
نویسندگان
چکیده
This paper describes a method of automatic labeling of prosodic information focusing on accent types and accent phrase boundaries for Japanese spoken sentences. They are predicted by CRF (Conditional Random Fields) using linguistic information and F0 contour information. In the prediction of the accent type, we propose a method that uses a provisional accent type predicted by linguistic information and accentuation rules. The actual accent type is predicted by F0 information and linguistic information which includes the provisional accent type as one of features, under the condition that contents of speech and accent phrase boundaries are given. Evaluation experiments show that the introduction of accentuation rules improves accuracy of the accent type prediction by 6.1% and the prediction rate is 59.6% for spontaneous Japanese speech data. In the prediction of the accent phrase boundary, we propose a method that uses linguistic and prosodic probability models under the condition that the contents of speech and word labels are given. The prediction accuracy of accent phrase boundary is 76.5%.
منابع مشابه
Detecting accent sandhi in Japanese using a superpositional F0 model
In this report, we propose a method for automatic prosodic structure recognition of Japanese utterances based on a superpositional F0 model, focusing particularly on the accent sandhi phonemenon in compound nouns. The method enables automatic labeling of F0 contours using the model, which can be useful for creating prosodic databases containing F0 contours in a parametric form. The prosodic str...
متن کاملAutomatic Scoring for Prosodic Proficiency of English Sentences Spoken by Japanese Based on Utterance Comparison
This paper describes techniques of scoring prosodic proficiency of English sentences spoken by Japanese. The multiple regression model predicts the prosodic proficiency using new prosodic measures based on the characteristics of Japanese novice learners of English. Prosodic measures are calculated by comparing prosodic parameters, such as F0, power and duration, of learner’s and native speaker’...
متن کاملDetection of Non-Native Named Entities Using Prosodic Features for Improved Speech Recognition and Translation
In this work, we describe the use of acoustic-prosodic features to detect and localize non-native named entities spoken by a native speaker in the target language (English) for the purpose of improved speech recognition and translation. The exaggerated variation in accent and duration introduced by the speaker for non-native names is exploited in the detection process through the use of prosodi...
متن کاملAccent type and phrase boundary estimation using acoustic and language models for automatic prosodic labeling
This paper proposes an automatic prosodic labeling technique for constructing speech database used for speech synthesis. In the corpus-based Japanese speech synthesis, it is essential to use annotated speech data with prosodic information such as phrase boundaries and accent types. However, manual annotation is generally time-consuming and expensive. To overcome this problem, we propose an esti...
متن کاملCombining acoustic, lexical, and syntactic evidence for automatic unsupervised prosody labeling
Automatic labeling of prosodic events in speech has potentially significant implications for spoken language processing applications, and has received much attention over the years, especially after the introduction of annotation standards such as ToBI. Current labeling techniques are based on supervised learning, relying on the availability of a corpus that is annotated with the prosodic label...
متن کامل